Inverse folding of RNA II
نویسندگان
چکیده
In the Oxford Summer School in Computational Biology 2011, one of the projects was aimed at creating an improved method for inverse RNA folding based on a genetic algorithm (GA) approach. This turned out to be sufficiently successful [16] and inspiring, that we will extend on it in this year’s summer school. The inverse RNA folding problem consists of finding an RNA sequence that, as a molecule, folds into a target structure, or structures. We will focus exclusively on what is known as the secondary structure of the molecule, which is the set of base pair interactions formed in the structure. The secondary structure can be inferred from the sequence with reasonable precision, with several paradigms available. This allows the inverse folding problem to be addressed completely in silico (i.e. computationally), using predicted structures as a proxy for the true structure of a designed sequence. The aim of this follow-up project is to address at least some of the following points: 1. enhance the existing GA with functionality for multi-objective fitness and objective functions 2. extend the method to utilise more folding paradigms to create more robust designs 3. extend the software with a graphical user interface and parallelism 4. improve the sequence initialisation and search methods to be able to design perfect sequences for more targets 5. extend the target specification to include structures with pseudoknots, nucleotide constraints for specific positions as well as overall distributions, less specific targets with regions where any structure is allowed or even as more abstract shapes, and possibly even two or more molecules forming complex structures 6. improve complexity results for the inverse RNA folding problem It is not realistic to assume that all these points can be addressed in full, or possibly even at all. The list should work as a source of inspiration, from which you can choose the most relevant and interesting elements to work on. However, the ordering does reflect our a priori prioritisation. This project description first describes the basic background for the inverse folding problem, including a brief summary of our method. It then outlines the points for extensions in more details, also discussing the importance of each proposed extension.
منابع مشابه
Relation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملMulti-Objective Genetic Algorithm for Pseudoknotted RNA Sequence Design
RNA inverse folding is a computational technology for designing RNA sequences which fold into a user-specified secondary structure. Although pseudoknots are functionally important motifs in RNA structures, less reports concerning the inverse folding of pseudoknotted RNAs have been done compared to those for pseudoknot-free RNA design. In this paper, we present a new version of our multi-objecti...
متن کاملMODENA: a multi-objective RNA inverse folding
Artificially synthesized RNA molecules have recently come under study since such molecules have a potential for creating a variety of novel functional molecules. When designing artificial RNA sequences, secondary structure should be taken into account since functions of noncoding RNAs strongly depend on their structure. RNA inverse folding is a methodology for computationally exploring the RNA ...
متن کاملDesign of RNAs: comparing programs for inverse RNA folding.
Computational programs for predicting RNA sequences with desired folding properties have been extensively developed and expanded in the past several years. Given a secondary structure, these programs aim to predict sequences that fold into a target minimum free energy secondary structure, while considering various constraints. This procedure is called inverse RNA folding. Inverse RNA folding ha...
متن کاملINFO-RNA - a fast approach to inverse RNA folding
MOTIVATION The structure of RNA molecules is often crucial for their function. Therefore, secondary structure prediction has gained much interest. Here, we consider the inverse RNA folding problem, which means designing RNA sequences that fold into a given structure. RESULTS We introduce a new algorithm for the inverse folding problem (INFO-RNA) that consists of two parts; a dynamic programmi...
متن کامل